Application of finite-state transducers to the acquisition of verb subcategorization information

نویسندگان

  • Izaskun Aldezabal
  • María Jesús Aranzabe
  • Koldo Gojenola
  • Maite Oronoz
  • Kepa Sarasola
  • Aitziber Atutxa
چکیده

This paper presents the design and implementation of a finite-state syntactic grammar of Basque that has been used with the objective of extracting information about verb subcategorization instances from newspaper texts. After a partial parser has built basic syntactic units such as noun phrases, prepositional phrases, and sentential complements, a finite-state parser performs syntactic disambiguation, determination of clause boundaries and filtering of the results, in order to obtain a verb occurrence together with its associated syntactic components, either complements or adjuncts. The set of occurrences for each verb is then filtered by statistical measures that distinguish arguments from adjuncts.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Connectionist Model of Verb Subcategorization

Much of the debate on rule-based vs. connectionist models in language acquisition has focussed on the English past tense. This paper investigates a new area, the acquisition of verb subcategorization. Verbs differ in how they express their arguments or subcategorize for them. For example, “She gave him a book.” is good, but “She donated him a book.” sounds odd. The paper describes a connectioni...

متن کامل

A Bootstrapping Approach to Parser Development

This paper presents a robust parsing system for unrestricted Basque texts. It analyzes a sentence in two stages: a unification-based parser builds basic syntactic units such as NPs, PPs, and sentential complements, while a finite-state parser performs syntactic disambiguation and filtering of the results. The system has been applied to the acquisition of verbal subcategorization information, ob...

متن کامل

Bengali Verb Subcategorization Frame Acquisition - A Baseline Model

Acquisition of verb subcategorization frames is important as verbs generally take different types of relevant arguments associated with each phrase in a sentence in comparison to other parts of speech categories. This paper presents the acquisition of different subcategorization frames for a Bengali verb Kara (do). It generates compound verbs in Bengali when combined with various noun phrases. ...

متن کامل

Semitic Morphological Analysis and Generation Using Finite State Transducers with Feature Structures

This paper presents an application of finite state transducers weighted with feature structure descriptions, following Amtrup (2003), to the morphology of the Semitic language Tigrinya. It is shown that feature-structure weights provide an efficient way of handling the templatic morphology that characterizes Semitic verb stems as well as the long-distance dependencies characterizing the complex...

متن کامل

Learning Subcategorization

A method to identify the subcategorized constituents of a verb (its complements) automatically in a sentence is useful in various areas of Natural Language Processing (e.g. automatic acquisition of subcategorization lexicons, parsing, acquisition of verb semantics, information retrieval). I will describe a method for subcategorization identification that uses memorybased learning. Train and tes...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Natural Language Engineering

دوره 9  شماره 

صفحات  -

تاریخ انتشار 2003